# German Speech Recognition
Whisper Medium Cv11 German Ct2
Apache-2.0
Automatic speech recognition model fine-tuned on the Common Voice 11.0 German dataset based on OpenAI's whisper-medium model
Speech Recognition
Transformers German

W
mkenfenheuer
21
1
Whisper Large V3 Turbo German Ct2
Apache-2.0
A German speech recognition model based on Whisper Large v3, optimized for German speech processing and recognition
Speech Recognition
Transformers German

W
jimmymeister
38
3
Whisper Large V3 Turbo German
Apache-2.0
A fine-tuned model for German speech recognition based on Whisper Large v3, specifically optimized for German speech processing and recognition.
Speech Recognition
Transformers German

W
primeline
2,777
33
Distil Whisper Large V3 German
Apache-2.0
A German speech recognition model based on distil-whisper technology, with 756 million parameters, achieving faster inference speeds while maintaining high quality.
Speech Recognition
Transformers German

D
primeline
207
15
Whisper Tiny German
Apache-2.0
A German speech recognition model based on whisper-tiny, with 37.8 million parameters, suitable for edge scenarios sensitive to model size.
Speech Recognition
Transformers German

W
primeline
198
8
Whisper Large V3 German
Apache-2.0
A fine-tuned German speech recognition model based on Whisper Large v3, optimized for German speech processing and recognition
Speech Recognition
Transformers German

W
primeline
8,745
70
Stt De Fastconformer Hybrid Large Pc
This is a German automatic speech recognition model based on the FastConformer architecture, employing a hybrid training approach with Transformer and CTC, with a parameter size of approximately 115M.
Speech Recognition German
S
nvidia
1,017
4
Whisper Large V2 Cv11 German
Apache-2.0
An automatic speech recognition model fine-tuned on the Common Voice 11.0 German dataset based on openai/whisper-large-v2, supporting German speech-to-text with a word error rate of 5.76
Speech Recognition
Transformers German

W
bofenghuang
179
16
Stt De Conformer Transducer Large
This is a large Conformer-Transducer model for German automatic speech recognition, with approximately 120 million parameters, supporting the transcription of German speech into text.
Speech Recognition German
S
nvidia
66
6
Stt De Conformer Ctc Large
This is a large-scale Conformer-CTC model for German automatic speech recognition, trained and optimized by NVIDIA on thousands of hours of German speech data.
Speech Recognition German
S
nvidia
132
4
Wav2vec2 Large Xlsr 53 German Cv9
Apache-2.0
This is an automatic speech recognition (ASR) model fine-tuned on the German Common Voice 9.0 dataset, based on Facebook's wav2vec2-large-xlsr-53 model.
Speech Recognition
Transformers German

W
oliverguhr
98
1
Wav2vec2 Xls R 1b Tevr
Apache-2.0
This is a German speech recognition model based on the wav2vec 2.0 XLS-R 1B architecture, incorporating TEVR (Token Entropy Variance Reduction) technology and combined with a 5-gram language model. It achieves a word error rate of 3.64% on the Common Voice German test set.
Speech Recognition
Transformers German

W
fxtentacle
311
14
Wav2vec2 Large Xls R 300m German With Lm
Apache-2.0
A speech recognition model fine-tuned on the Common Voice German dataset based on facebook/wav2vec2-xls-r-300m, integrated with an n-gram language model, achieving a word error rate of 8.8%
Speech Recognition
Transformers

W
mfleck
26
1
Wav2vec2 Large Xlsr 53 German
Apache-2.0
An automatic speech recognition model fine-tuned on the Common Voice German dataset based on facebook/wav2vec2-large-xlsr-53, achieving a test WER of 15.80%.
Speech Recognition German
W
marcel
25
1
Wav2vec2 100m Mls German Ft
Apache-2.0
This model is an automatic speech recognition (ASR) model fine-tuned on the German subset of the multilingual LibriSpeech dataset, based on facebook/wav2vec2-xls-r-100m
Speech Recognition
Transformers

W
patrickvonplaten
27
0
German Pretrained
Apache-2.0
This model is a fine-tuned German speech recognition model based on flozi00/wav2vec-xlsr-german, achieving a word error rate of 1.0 on the evaluation set.
Speech Recognition
Transformers

G
chaitanya97
30
0
Wav2vec2 Xls R 1b German
Apache-2.0
This is a German automatic speech recognition model based on the XLS-R 1B architecture, fine-tuned on multiple German speech datasets including Common Voice 8.0
Speech Recognition
Transformers German

W
jonatasgrosman
105
3
Wav2vec2 Xls R 1b De Cv8
Apache-2.0
An automatic speech recognition model fine-tuned on the Common Voice 8 German dataset based on facebook/wav2vec2-xls-r-1b
Speech Recognition
Transformers German

W
jsnfly
22
0
Wav2vec2 100m Mls German Ft 2
Apache-2.0
A German automatic speech recognition model fine-tuned on the MULTILINGUAL_LIBRISPEECH - GERMAN dataset, based on facebook/wav2vec2-xls-r-100m
Speech Recognition
Transformers

W
patrickvonplaten
23
0
Wav2vec2 Large Xlsr German Demo
Apache-2.0
A speech recognition model fine-tuned on the German Common Voice dataset based on facebook/wav2vec2-large-xlsr-53, with a word error rate of 29.35%
Speech Recognition German
W
marcel
23
1
Wav2vec2 Large Xlsr 53 German Gpt2
Apache-2.0
This is an automatic speech recognition encoder-decoder model trained on the MOZILLA-FOUNDATION/COMMON_VOICE_7_0 German dataset, combining the strengths of Wav2Vec2 and GPT2 architectures.
Speech Recognition
Transformers German

W
jsnfly
28
2
Wav2vec2 Xlsr 300m German Truecase
Based on Facebook's wav2vec2-xls-r-300m model, fine-tuned on the Common Voice German dataset, supporting German speech recognition with preserved text case information.
Speech Recognition
Transformers

W
abnerh
16
1
Wav2vec2 Large Xlsr 53 German With Lm
Apache-2.0
This is a German automatic speech recognition model based on the XLSR Wav2Vec2 architecture with language model support, excelling on the Common Voice German dataset.
Speech Recognition
Transformers German

W
aware-ai
19
7
Wav2vec2 Base 10k Voxpopuli Ft De
A speech recognition model based on Facebook's Wav2Vec2 base model, pretrained on a 10K-hour unlabeled subset of the VoxPopuli corpus and fine-tuned on German transcription data
Speech Recognition
Transformers German

W
facebook
46
1
Wav2vec2 Large Xlsr 53 German
Apache-2.0
Large-scale German automatic speech recognition (ASR) model based on Facebook's Wav2Vec2 architecture, fine-tuned on the Common Voice German dataset
Speech Recognition German
W
facebook
1,767
3
German Trained
Apache-2.0
This model is a fine-tuned German speech recognition model based on flozi00/wav2vec-xlsr-german, primarily used for German speech-to-text tasks.
Speech Recognition
Transformers

G
chaitanya97
24
0
Wav2vec2 Xls R 300m German De
Apache-2.0
This model is a fine-tuned German automatic speech recognition (ASR) model based on facebook/wav2vec2-xls-r-300m on the MOZILLA-FOUNDATION/COMMON_VOICE_7_0 - DE dataset.
Speech Recognition
Transformers German

W
AndrewMcDowell
72
3
Wav2vec2 Xls R 1B German
Apache-2.0
This model is a fine-tuned version of facebook/wav2vec2-xls-r-1b on the MOZILLA-FOUNDATION/COMMON_VOICE_8_0 - German dataset for German automatic speech recognition tasks.
Speech Recognition
Transformers German

W
AndrewMcDowell
48
1
Featured Recommended AI Models